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DESIGN  AND  OPERATION  OF  A  MULTICOMMODITY 
PRODUCTION/DISTRIBUTION  SYSTEM  USING 
PRIMAL  GOAL  DECOMPOSITION* 

GERALD  G.  BROWN,  GLENN  W.  GRAVES  and  MARIA  D.  HONCZARENKO 
Department  of  Operations  Research,  Naval  Postgraduate  School,  Monterey, 

California  93943 

Graduate  School  of  Management,  University  of  California,  Los  Angeles,  California  90024 
Nabisco  Brands,  Inc.,  East  Hanover,  New  Jersey  07936 

An  optimization-based  decision  support  system  has  been  developed  and  used  by  NABISCO 
to  manage  complex  problems  involving  facility  selection,  equipment  location  and  utilization, 
and  manufacture  and  distribution  of  products  such  as  the  familiar  Ritz  Crackers,  Oreo  Cookies, 

Fig  Newtons,  etc.  (all  product  names  trademarks  of  NABISCO).  A  mixed-integer,  multi-com¬ 
modity  model  is  presented  for  the  problems  at  hand,  and  a  new  class  of  goal  decompositions  is 
introduced  to  yield  pure  network  subproblems  for  each  commodity;  the  associated  master 
problems  have  several  notable  properties  which  contribute  to  the  effectiveness  of  the  algorithm. 
Excellent  quality  solutions  for  problems  with  more  than  40,000  variables  (including  several 
hundred  binary  variables  with  fixed  charges)  and  in  excess  of 20,000  constraints  require  only  0.6 
megabytes  region  and  less  than  one  compute  minute  on  a  time-shared  IBM  3033  computer, 
average  problems  (with  fewer  binary  variables)  require  only  a  second  or  two.  The  solution 
method  has  more  to  recommend  it  than  sheer  efficiency:  new  insights  are  given  for  the  funda¬ 
mental  convergence  properties  of  formal  decomposition  techniques.  Several  applications  of  this 
powerful  interactive  tool  are  discussed. 

(PROGRAMMING— LARGE-SCALE  SYSTEMS;  PROGRAMMING— INTEGER,  APPLI¬ 
CATIONS;  FACILITIES/EQUIPMENT  PLANNING) 

1.  Introduction 

The  production/distribution  system  design  problems  considered  here  derive  from 
current  operations  of  the  Biscuit  Division  of  Nabisco  Brands,  Inc.  Nabisco  bakeries 
produce  several  hundred  products  for  nationwide  distribution  (e.g.,  Ritz  Crackers,  Oreo 
Cookies,  Fig  Newtons,  etc.).  Production  takes  place  in  batches,  each  of  which  involves  a 
relatively  brief  setup  followed  by  a  continuous  run  of  product.  The  continuous  produc¬ 
tion  run  involves  two  key  operations:  baking,  in  which  raw  ingredients  are  fed  contin¬ 
uously  into  an  oven,  and  secondary  operations,  such  as  sorting,  packaging  and  labeling 
finished  products.  Thus,  primary  facilities  (i.e.,  ovens)  and  secondary  facilities  (e.g., 
packing  lines)  must  be  operated  synchronously. 

Scheduling  and  operation  of  bakeries  is  a  complex  managerial  task.  Each  oven  is 
capable  of  producing  many  (but  not  all)  products  with  varying  efficiency.  While  baking 
a  particular  product,  each  oven  uses  a  designated  secondary  facility  for  that  product  as 
determined  by  the  physical  layout  of  the  bakery;  the  secondary  facility  may  be  shared 
with  other  ovens  in  operation  at  that  time.  Production  must  be  assigned  to  bakeries  so 
that  total  costs — manufacturing  and  transportation  costs — are  as  low  as  possible. 

Among  the  operational  issues  to  be  resolved  are: 

— Where  should  each  product  be  produced? 

— How  much  production  of  each  product  should  be  assigned  to  each  primary 
facility? 

— From  where  should  product  be  shipped  to  each  customer? 

Over  time,  new  products  are  introduced  and  customer  demand  shifts  for  each  prod- 

*  Accepted  by  Donald  Erlenkotter,  former  Departmental  Editor;  received  May  17,  1983.  This  paper  has 
been  with  the  authors  19  months  for  4  revisions. 
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uct  and  geographic  region.  Eventually,  it  becomes  necessary  to  build  new  plants  (or 
modify  or  close  existing  plants)  and  to  introduce  new  facilities  (or  move  or  sell  old 
ones).  The  strategic  issues  include: 

— Where  shall  plants  be  located? 

— What  kind  of  facilities  are  needed,  how  many  are  required,  and  where  should  they 
be  located? 

We  have  endeavored  to  answer  all  these  questions  by  developing  a  data  base  and 
decision  support  system  that  utilizes  a  large  embedded  optimization  model.  User- 
friendly  interactive  facilities  provide  rapid  response  to  operational  and  strategic  queries 
by  simple  editing  and  control  of  the  model  monolith.  Reasonable  model  resolution 
entails  approximately: 

100-200  Products  (product  groups), 

200-300  Facilities,  including 

80-100  primary  facilities  (ovens) 

120-200  secondary  facilities, 

10-20  plants  (bakeries),  and 
120-170  Customer  zones. 

Useful  response  time  necessarily  is  very  short.  Consequently,  a  great  deal  of  elfort  has 
been  invested  in  the  optimization  methods. 

In  the  sections  that  follow,  the  details  of  the  mathematical  model  are  presented,  a 
new  class  of  decompositions  is  introduced  and  applied,  a  mathematical  justification  for 
the  decomposition  is  presented,  and  some  general  insights  are  given  for  the  relative 
convergence  properties  of  classical  decomposition  methods  and  the  approach  devel¬ 
oped  here.  Finally,  we  give  several  examples  of  application  of  the  decision  support 
system. 


2.  Mathematical  Model 

Our  formulation  uses  the  following  notation: 

/  is  the  index  set  of  products; 

J  is  the  index  set  of  facilities; 

K  is  the  index  set  of  plants; 

L  is  the  index  set  of  customers. 

In  addition  to  these  primary  index  sets  we  require  some  additional  derived  index  sets. 
The  set  of  facilities  J  is  partitioned  into  primary  (baking)  facilities,  Ji ,  and  secondary 
facilities,  J2. 

The  set  of  activities  is  defined  by  the  combination  of  locations  and  facility  types: 

A  =  JXK=iJiXK)\J{J2y.K)  =  Ax  \JA2. 

The  activities  are  restricted  as  to  location,  and  A  =  AiU  A2(l  A  are  the  subsets  of 
allowable  combinations. 

The  set  of  production  elements  is  defined  by  the  combination  of  products  and  pri¬ 
mary  activities: 


F  =  /X4i  =/X  J,  XK 


The  production  elements  are  restricted  as  to  primary  activity,  and  PdP  'vs  the  set  of 
allowable  combinations.  Each  allowable  production  element  is  associated  with  a 
unique  secondary  activity,  as  defined  by  the  map  M:  P  -*  A2.  The  map  M  induces  a 
partition  of  P  according  to  the  secondary  activity  used: 

Ra^  =  {pd  P\Mlp)  =  az  G  A2}. 
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The  set  of  primary  activities  that  produce  a  given  product  /  G  /  is: 

Ti  =  (a,  GP). 

The  given  data  for  the  model  are: 

Du  demand  for  product  i  in  customer  zone  /, 

Sijk  capacity  for  product  i  on  facility  j  at  plant  k, 

Cijk  average  unit  cost  of  producing  product  i  on  facility  j  at  plant  k, 
fiki  unit  cost  of  shipping  product  i  from  plant  k  to  customer  /, 

Gjk  fixed  portion  of  the  annual  possession  and  operating  cost  for  facility  j  at  plant  k, 
Fk  fixed  portion  of  the  annual  possession  and  operating  cost  for  a  plant  at  site  k, 

Yijk  yield  of  product  i  on  primary  facility  j  at  plant  k, 

Uijk  rate  of  utilization  by  product  i  of  secondary  facility  j  at  plant  k, 

Cjk,  Cjk  minimum  and  maximum  utilization  of  facility  j  at  plant  k, 

Nk,  iV*  minimum  and  maximum  number  of  facilities  at  plant  k. 

The  variables  of  the  model  are: 

Xijki  the  amount  of  product  i  produced  on  facility  j  at  plant  k  shipped  to  customer  /, 
Sijk  the  amount  of  product  i  produced  on  facility  j  at  plant  k, 

Wjk  a  0-1  assignment  variable  of  facility  j  to  plant  k, 

Z*  a  0-1  close-open  variable  for  plant  k. 

The  problem  is  formulated  as  the  following  mixed  integer  linear  program: 
MINIMIZE  2  XijkiiCijk  +fiki)  +  2  GjkWjk  +  2  FkZk 

Xs,Pf',z  (iJJcieP  UJcyeA  ksK 

/EL 

SUBJECT  TO 


2  Xijki  -  Da,  i  G  7, 

(JJc)eTi 

/GL, 

(2.1) 

2  ^ijkl  ^  ^ijkj  ^  ^9 

/EL 

(2.2) 

CjkWjk<  2  SijklYijk<CjkWjk, 

(ij,k)sp 

{j,k)  =  ai^Ai, 

(2.3) 

CjkWjk  <  2  UijkSijk  <  CjkWjk, 

(ij,k)eRa2 

U,k)  =  a2E:A2, 

(2.4) 

NkZk^  2  Wjk<NkZk, 

kGK, 

(2.5) 

2  Wjk=  1, 
keK 

j^J, 

(2.6) 

and  the  variable  bounds 

0  <  Xijki, 

(2.7) 

0  ^  Sijk  ^  Fijk, 

(2.8) 

Wjk={0,  1}, 

(2.9) 

II 

p 

(2.10) 

The  constraints  (2.1)  ensure  that  all  demand  is  met.  The  constraints  (2.2)  ensure  that 
products  shipped  are  produced.  The  constraints  (2.3)  and  (2.4)  are  multi-product  ca- 

pacity  restrictions  on  the  primary  and  secondary  activities.  The  constraints  (2.5)  limit 
the  assignment  of  facilities  to  plants.  The  constraints  (2.6)  ensure  that  a  facility  is 
assigned  to  only  one  plant. 

A  realistic  prototypic  problem  has  150  products,  218  facilities,  10  plants  and 
127  customer  zones.  The  number  of  production  elements  is  345  and  thus  there  are 
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345  X  127  =  43,8 1 5  Xyki  variables.  Adding  the  345  Syk  variables  and  the  10  Z*  variables, 
the  problem  has  a  total  of 44,388  variables.  There  are  150  X  127  =  19,050  constraints  of 
type  (2.1),  345  of  type  (2.2),  218  of  type  (2.3)  and  (2.4),  10  of  type  (2.5),  and  218  of  type 
(2.6)  yielding  a  total  of  19,841  constraints. 

3.  Decomposition 

Problems  of  dimension  19,841  X  44,388  can  prove  to  be  somewhat  intractable  when 
approached  in  a  direct  manner.  Examination  shows,  however,  that  separating  the  Xyki 
variables  by  fixing  Syk,  Wjk,  and  Z*  in  a  primal  decomposition  breaks  out  disjoint 
transportation  problems  by  product  and  leaves  a  relatively  modest  master  problem.  At 
the  nth  iteration,  we  have  the  following  subproblems,  one  for  each  i  G  7: 

(/th  subproblem  at  nth  iteration) 

MINIMIZE  2  XijkiiCijk-^fud) 

^  U,k)eP 

SUBJECT  TO 


II 

b 

1  ^L, 

(3.1) 

2  ^ijk!  ^  Syk, 

(j,  k)  G  P, 

(3.2) 

tez. 


and  the  variable  bounds  (2.7). 

The  supplies  Syk  in  (3.2)  for  the  nth  iteration  are  obtained  from  a  master  problem. 
The  master  problem  at  the  nth  iteration  is  given  as  follows,  where  Ic  is  the  best  known 
solution  value  for  the  entire  problem  through  the  nth  iteration.  Let  Vy  be  the  dual 
variables  associated  with  the  constraints  (3.1)  and  Uyk  be  the  dual  variables  associated 
with  the  constraints  (3.2). 

(Master  Problem) 

MINIMIZE  2  GjkWjk+^FkZk 

s,W,Z  (j,k)eA  ksK 

Subject  to: 


2  Syk> 'Z,  F>ii,  iE.1,  (3.3) 


u,mTi  teL 

GjkWjk^  2  SykIYyk  <  CjkWjk, 

(ij,k)eP 

(7, /:)  =  «!  G^, 

(2.3) 

CjkWjk  <  2  UykSyk  ^  CjkWjk, 

UJ,k)eRa2 

(7,  k)  =  flz  e  Tz, 

(2.4) 

NkZk^ZWjk<NkZk, 

jeJ 

k  GK, 

(2.5) 

2  Wjk=\, 
kSK 

j  eT, 

(2.6) 

2.  8 

^ijk  ~  ^ijki 

(/,7,  k)  G  P, 

(3.4) 

2  UykSyk  +  2  GjkWjk  +  2  FkZk  <  7c  +  2  VuDy  —  €,  t  <  H,  (3.5) 

(U,k)sP  ikeA  keK  iel 

leL 


and  the  variable  bounds  (2.8)-(2.10). 

The  constraints  (3.3)  are  added  to  ensure  sufficient  supply  for  feasibility  of  the 
subproblem  at  each  iteration.  The  cuts  (3.5)  are  developed  in  the  next  section.  The 
constraints  (3.4)  are  the  really  novel  feature  of  this  formulation.  They  are  a  set  of 
production  goals  used  to  provide  additional  information  in  the  allocation  of  the  activity 
capacity  across  the  various  products.  The  symbol  =  indicates  that  each  may  be  violated 
at  a  small  linear  penalty  cost.  The  penalty  cost  must  be  small  enough  to  ensure  that 
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these  “goals”  will  be  sacrificed  as  needed  to  satisfy  the  other  relatively  stifFer  con¬ 
straints. 

In  this  model  the  goals  are  obtained  initially  by  solving  the  subproblems  with 
s?jk  =  Sijk,  the  individual  production  capacities,  while  ignoring  the  multi-product  capac¬ 
ity  restrictions  (2.3),  (2.4).  Since  these  goals  are  obtained  as  a  relaxation  of  the  original 
problem,  this  provides  a  lower  bound  on  the  value  of  the  subproblems. 

Typically  the  most  desirable  activities  will  be  overutilized  by  the  subproblems  and 
exceed  their  joint  product  capacities  as  specified  in  the  constraints  (2.3),  (2.4).  There¬ 
fore  the  actual  subproblem  use  (2/e£  TT^w)  is  scaled  down  proportionally  by  product  to 
equal  exactly  the  joint  product  capacities  Q*.  This  is  done  after  a  heuristic  has  been 
used  to  select  a  good  initial  configuration  of  assignments  of  facilities  to  plants.  Each 
unassigned  facility  is  assigned  to  the  compatible  plant  from  which  the  subproblems 
have  drawn  the  highest  total  proportions  of  demands  for  products  made  there. 

The  use  of  decomposition  goals  profoundly  influences  the  rate  of  convergence 
achieved  with  decomposition. 

Dual  decomposition  models  with  goals  have  been  suggested  by  Ruefli  (1971)  as  a 
purely  conceptual  tool  for  interpretation  of  hierarchical  organizational  behavior.  Sub¬ 
sequently,  Freeland  (1976)  discussed  primal  decomposition  goal  models  in  the 
same  vein. 


4.  Justification  of  the  Decomposition 
Given  the  mathematical  program 

MP  min  wy  =  Wiyi  +  ^2^2 
s.t.  Ay  =  A' yi+  A^y2  ^  r, 

->;<o,  T2er, 

which  becomes  a  linear  programming  problem  given  fixed  values  for  y2,  the  problem 
may  be  solved  sequentially  as  a  function  of  y^: 

LP(j;2)  min  Wiyi  (+W2  3^2) 
s.t.  A'yi<r-A^  y2, 

-yi  0. 

Let  Fp  be  the  set  of  primal  feasible  solutions  y^iyuVi)  of  MP.  Let  R(y2)  be  the  set  of 
fiyi)  =  r  —  A^y2  that  yield  a  finite  optimal  solution  to  LP(j;2).  Also  let  Fp(y2)  be  the  set  of 
primal  feasible  solutions  yi(y2)  of  LP(j;2)  and  viyi)  be  the  optimal  value  of  LP(j;2)- 
Let  v{y2)  =  v(y2)  +  W2y2-  Now  (3^1,  3^2)  S  Fp  yields  a  y  such  that  r(3^2)  ^  ^(>’2)  and 
yi  G  ^^(3^2)  and  conversely,  and  therefore  every  feasible  solution  of  MP  yields  a  feasible 
solution  in  the  class  of  LP  problems  LP(3;2)  and  conversely.  The  optimal  value  of  MP  is 

min;^e«(w)  ^y2)- 

Let  the  dual  of  the  relaxation  of  MP,  deleting  the  restriction  3;2  G  P,  and  the  dual  of 
LP(3;2)  be  respectively 

DP  max  xr  DP(y2)  max  x(r  -  A^y2)  (+W2y2) 

s.t.  xA  <  w,  s.t.  xA‘  <Wi 

X  <0,  X  <0, 

with  corresponding  feasible  solution  sets  Fj  and  Fdiyi)- 


1474  GERALD  G.  BROWN,  GLENN  W.  GRAVES  AND  MARIA  D.  HONCZARENKO 


Let  ^(>’2)  be  an  optimal  solution  for  DP(j;2)  and  then 

Viyi)  =  Hyi)  +  ^2^2  =  X(y2)r(y2)  +  W2>^2 
=  x(y2)[r  -  A'^y2]  +  W2>^2 
=  x(y2)r  +  [W2  -  x(y2)A^]y2. 

Now  v(y2)  as  a  function  of  r(y2)  can  be  strongly  characterized.  The  function  t;(>’2)  is  a 
continuous,  convex,  piecewise  linear  function.  Using  the  results  that  a  continuous 
function  of  a  continuous  function  is  also  continuous,  and  that  the  sum  of  convex 
functions  is  convex,  ^(^2)  is  also  a  continuous,  convex,  piecewise  hnear  function. 

If  the  convex  function  v(y2)  were  also  continuously  differentiable,  we  could  apply  the 
classical  result  about  convex  functions  that  v(y2)  >  +  ’^v(y2)(y2  —  T2)  to  obtain  a 

linear  lower  bound  function  that  would  provide  necessary  conditions  for  i;(j;2)  <  viy^). 
Although  v(y2)  does  have  discontinuities  in  its  first  derivatives,  a  linear  lower  bound 
function  is  still  easily  obtainable.  For  any  xE.Fd,yEFp,by  the  weak  duality  theorem 
of  linear  programming, 

xr  <  xr  +  (tv  —  xA)y  =  wy  +  x[r  —  Ay]  <  wy, 

since 

xGFrf=>x<0,  {w-xA)>0, 

yEiFp=^y>Q,  {r- Ay)  >0. 

Also, 

(w  -  xA)y  =  [wi  -  xA']yi  +  [W2  -  xA^]y2. 

Restricting  ourselves  to  x(y2)  G  Ft/(y2),  yE.Fp  for  a  fixed  value  y2,  [wj  -  x{y2)A']yi  >  0, 
and  therefore, 

Kyi,  yi)  ^  x(y2)r  +  [W2  -  x{y2)A^\y2 

=  wy  +  x(j;2)[^  -  Ay]  -  [w,  -  xiy2)A  ‘]yi 
<  wy  +  x(y2)[^  —  Ay]  <  wy. 

The  linear  lower  bound  function  Kyi',  T2)  =  x(y2)r  +  [W2  -  x(y2)A^]y2  =  t)(y2)  at 
yi  =  yi  by  direct  comparison  with  the  expression  for  v(y2).  (This  is  an  indirect  conse¬ 
quence  of  the  complementary  slackness  requirement  that  [wi  —  x(y2M*]yi(y2)  =  0.) 

The  class  of  linear  lower  bound  functions  /(yi',  fi)  parameterized  by  y2  can  be 
employed  to  generate  a  sequence  {y^}  such  that  t)(y2)  converges  to  the  minimum  of 
t;(y2).  It  is  a  necessary  condition  for  an  e  improvement  over  the  best  known  incumbent 
value  viyi)  that  y2  satisfy  the  linear  inequalities, 

(Master  Problem  cuts) 

Kyi',  yi)  =  x(yi)r  -t-  [W2  -  x(y‘2)A^M  <  ’’(yi)  t<n. 

(Inequalities  (3.5)  are  an  instance  of  this.) 

When  the  master  problem  is  infeasible  ^(^2)  >  ’Kyi)  ~  «  and  ^(yl)  is  an  e-optimal 
value,  since  any  y2  such  that  t;(y2)  <  viyi)  “  «  would  satisfy  /(y2 ;  yi)  <  t)(y2)  <  ’Kyi)  ~  « 
and  be  a  feasible  solution. 

Theorem.  Assume  t;(y2)  is  bounded  below.  Any  sequence  {^(yS)}  generated from  yl 
that  satisfies  the  master  problem  will  terminate  in  a  finite  number  of  steps  at  an  e-opti¬ 
mal  value. 

Proof.  Suppose  the  sequence  does  not  terminate  in  a  finite  number  of  steps  and 
yield  an  e-optimal  value.  Each  y”  generates  an  x(y2)  which  is  a  basic  optimal  solution  of 
DP(y2).  There  are  only  a  finite  number  of  bases  T\k)  for  DP(y2)  since  T{k)  is  not  a 
function  of  y2.  Every  x(y2)  satisfies  T(k)x(y2)  =  Wi  for  some  k,  and  since  Wi  is  also 
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independent  of  y2  there  are  only  a  finite  number  of  x^y^).  Therefore  let  s  be  the  first 
index  such  that  x(y2)  =  xiyi)  for  t  <s.  Then 

viyi)  =  Kyi',  yi)  =  Kyi,  yi)  <  i^iyi)  - 

and  v(y2)  becomes  the  new  incumbent.  Since  e  is  strictly  positive  v(y2)  will  violate  its 
lower  bound  in  a  finite  number  of  steps.  Q.E.D. 

An  analogous  development  is  given  for  dual  decomposition  and  hybrid  approaches 
by  Graves  and  Van  Roy  (1979). 

Although  any  choice  of  y2  satisfying  the  master  problem  will  ensure  finite  conver¬ 
gence,  the  rate  of  convergence  is  another  matter.  A  two-dimensional  representation  of 
v(y2)  as  a  piecewise  linear  convex  function  provides  insight  into  the  convergence  pro¬ 
cess.  (See  Figure  1 .) 

The  convexity  of  v(y2)  implies  that  the  slopes  of  the  linear  lower  bound  functions 
Kyi',  yi)  are  small  in  the  vicinity  of  the  optimum  and  monotonically  increase  with  y2. 
The  rate  of  change  of  Kyi',  yi)  is  very  sharp  when  remote  from  the  optimum,  and  when 
\yi  ~  yT'\  is  not  strongly  bounded  an  interminable  oscillation  can  (and  does)  occur 
between  the  wings.  Any  strong  minimization  of  the  Kyi',  yi)  such  as  the  customary 
decomposition  technique  (Benders  1962): 

(Customary  Master  Problem) 

min  V 

s.t.  Kyi',yi)<v, 

seems  to  force  this  type  of  oscillation  and  exhibit  very  weak  convergence;  the  cuts 
introduced  here  are  much  less  prone  to  this  unruly  behavior,  and  admit  a  rich  diversity 
of  coercion  via  the  objective  function.  Any  heuristic  that  will  yield  a  good  starting  r(>’°) 
should  be  very  beneficial.  (Note  r(y°)  =  r  —  A^y2  =  r  can  be  solved  for  y2  as  a  goal 
program.)  The  piecewise  linearity  shows  that  Kyi',  yi)  =  vCy”)  in  a  sufficiently  small 


Figure  I .  Graph  of  11(^2)  and  /( V2 ;  V2)  for  V2  G 
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neighborhood  of  y"  (until  a  basis  change).  In  general  since  l{y2 ;  y")  is  determined  by  the 
dual  variables  x^y"^,  which  are  the  derivatives  of  ^(>’2)  with  respect  to  r(>’2),  it  is  a  good 
approximation  to  v(y2)  only  in  a  small  neighborhood  of  ^2-  To  ensure  stable 
and  virtually  monotonic  convergence  it  is  usually  necessary  to  impose  local  bounds 
\y2  ~  yT^\  <  b  on  each  step  of  the  process.  Computational  experimentation  has 
strongly  confirmed  these  theoretical  insights. 

The  view  of  decomposition  as  the  minimization  of  a  nonlinear  function  presented 
here  is  closely  related  to  what  has  become  known  as  “Lagrangian  Relaxation”  (e.g., 
Geoffrion  1974),  especially  in  the  context  of  “subgradient  optimization”.  However,  in 
subgradient  optimization  only  the  last  ((>’2;  >^2)  is  used.  (To  see  this,  note  that  the 
Lagrangian  Relaxation  of  DP  leads  to  a  subgradient  problem  using  ((>’2,  yi)  as  the 
objective  goal  for  DP  (y^)-)  Since  it  is  a  necessary  condition  that  all  the  inequalities  of 
the  Master  Problem  be  satisfied  in  order  to  achieve  a  gain  at  a  given  step,  the  neglect  of 
all  but  the  last  inequality  greatly  complicates  and  weakens  convergence.  An  observant 
reader  will  find  precisely  this  unstable  behavior  in  the  numerical  examples  of  Held, 
Wolfe,  and  Crowder  (1974). 

The  decomposition  theory  advocated  here  also  readily  assimilates  the  use  of  goals. 
Using  a  3-level  hierarchy  of  linear  penalties,  a  high  penalty  for  all  noncut  and  nongoal 
constraints,  a  medium  penalty  for  the  cuts  (e.g.,  (3.5)),  and  a  low  penalty  for  the  goals 
(e.g.,  (3.4))  retains  all  the  essentials  for  convergence  by  ensuring  that  the  constraints  will 
violate  in  the  same  hierarchical  order.  The  violation  of  the  goal  constraints  at  each  step 
is  ignored.  When  a  violation  of  the  cut  constraints  (3.5)  occurs  a  new  e-optimal  bound  is 
achieved.  The  best  procedure  in  goal  incorporated  decomposition  is  to  use  a  decreasing 
sequence  e,  >  €2  >  €3  •  •  •  of  optimality  tolerances  and  a  decreasing  sequence  bi  >  Z?2 
>  ^>3  •  •  •  of  neighborhood  bounds.  The  tolerance  level  e,  is  reduced  when  cut  infeasibi¬ 
lity  occurs.  The  local  bound  6,  is  reduced  at  each  break  in  the  monotonic  decrease  of 


StepO:  Initialization 

(,  c/ initial,  minimal  convergence  tolerances 

5,  ^/initial,  minimal  trust  regions 

n  =  0,  initial,  final  numbers  of  cuts 

Incumbent  value  =  Large  number 

Configuration  Limitations  (some  fixed  assignments  V2) 

Step  1:  Initial  Configuration  and  Decomposition  Goals 
Heuristic  assignment  of  all  of  V2 
Set  initial  decomposition  goals 
Step  2:  Solve  Master  Problem 

If  feasible,  ignoring  goals,  go  to  Step  3 
Else  if  n  =  0  stop  infeasible. 

Else  if  c  <  c/go  to  Step  4 

Else  reduce  convergenee  tolerance  «  and  repeat  Step  2. 
Step  3:  n  =  n  +  1 

Solve  Subproblems 

Incumbent  test  for  improved  solution  value 
If  no  improvement  and  &  >  6/ reduce  trust  region  5 
Step  4:  If  n>  n/go  to  Step  5 

If  feasible,  ignoring  goals,  go  to  step  2 
Else  if  initial  heuristic  assignment  still  in  force 
Relax  heuristic  assignments 
Increase  convergence  tolerance  « 

Increase  trust  region  S 
Go  to  Step  2 
Else  go  to  Step  5 
Step  5:  Termination 


Figure  2.  A  Skeletal  Example  of  a  Primal  Goal  Decomposition  Algorithm. 
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v(y2).  Moderation  is  a  virtue  in  the  selection  of  the  tolerance  levels  e,  .  They  are  in  effect 
aspiration  levels,  and  greed  forces  large  departure  from  the  current  neighborhood  where 
the  dual  local  derivatives  x(y2)  have  some  validity. 

Figure  2  depicts  the  overall  operation  of  an  instance  of  the  entire  decomposition 
algorithm  developed  here.  The  local  trust  region  for  continuous  variables  in  the  master 
problem  can  be  managed  and  a  fashion  reminiscent  of  nonlinear  programming,  with 
movement  limited  to  a  fraction  of  the  current  values  (e.g.,  the  neighborhood  bounds  b„ 
initially  derive  from  (2.8)  and  the  restriction  fraction  parameter  S). 

5.  Computational  Experience 

The  decomposition  has  been  implemented  using  the  X-system  (Brown  and  Graves 
1975)  for  problem  generation,  coordination  and  master  problem  solution,  and  em¬ 
ploying  GNET  (Bradley,  Brown,  and  Graves  1977)  for  solving  the  pure  network  sub¬ 
problems. 

The  X-system  intrinsically  incorporates  and  exploits  goal  constraints,  so  that  the 
implementation  of  hierarchical  penalties  is  very  easy  and  the  solution  performance  is 
good.  GNET  (ca.  1981)  is  even  more  efficient  for  the  network  problems  than  its 
progenitor  (ca.  1974). 

We  were  particularly  interested  in  the  actual  behavior  of  the  decomposition  in  light 
of  the  theoretical  evidence.  Accordingly,  early  experiments  with  real-life  data  were  run 
with,  and  without  the  goal  constraints. 

Without  the  use  of  the  goals  (3.4),  30  iterations  could  not  produce  a  solution  of  the 
prototypic  problem  within  4  million  (annual)  dollars  of  the  optimum.  Using  the  goals 
the  initial  solution  was  3.5  million  dollars  better  than  the  final  solution  without  the 


Step  O'.  Initialization 

Cost  Conversion  (master-to-subproblems) 

Activity  Conversion  (master-to-subproblems) 

High  Penalty  (demand  (3.3),  facility  assignment  (2.5,  2.6)) 
Medium  Penalty  (production  (2.3,  2.4),  cuts  (3.5)) 

Low  Penalty  (supply  goals  (3.4)) 

Initial,  minimal  convergence  tolerance  (e,  ej) 

Initial,  minimal  trust  region  (£,  Sj) 

Configuration  Limitations 

Step  1:  Initial  Configuration,  initial  supply  goals,  production  goals 
Solve  subproblems  with  Syit  =  available  Sijk 
Set  production  goals  s^k  to  flows  'Ll  X/jki 
scaled  equal  to  product  capacities  Cjk 
If  all  facilities  assigned,  go  to  Step  2 
Unassigned  facilities  assigned  to  plants 
Find  production  capacities 
Solve  subproblems  with  S%k  =  available  S^k 
Set  production  goals  5  to  flows  2/  Xtjki 
scaled  equal  to  product  capacities  Cjk 

Step  2'.  Solve  Master  Problem  (with  5  J/t) 

If  €  can  be  reduced,  «  =  max  {c/2,  c/} 

Step  3:  Solve  Subproblems  with  s"jk  =  Sijk 
if  S  can  be  reduced,  5  =max  (5/2,  5/} 

Step  4;  If  heuristic  facility  assignments  relaxed 
increase  trust  region  c  =  4c/ 
increase  convergence  tolerance  5  =  45/ 

Step  5:  Termination 


(e.g.) 

($millions-to-mils,  10^) 
(10^) 
(100.0) 
(20.0) 
(0.5) 
(0.2,  0.05) 
(0.05,  0.0001) 


(fix  some  Wjk,  Zk’s) 


(lower  bound  on  v(MP)) 


(fix  all  Wjk,  Zk's) 


(for  Sijk,  Wjk,  Zk) 
(reduce  c) 

(for  Xijki) 
(reduce  5) 

(free  those  Wjk,  Zk’s) 
(increase  c) 
(increase  5) 


Figure  3.  Example  Details  for  Model  at  Hand. 
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goals.  Further,  the  initial  solution  (with  goals)  was  refined  with  12  iterations  to  yield  a 
further  gain  of  0.466  million  dollars. 

Although  the  solution  without  the  goals  was  within  about  1%  of  the  optimum  (which 
might  seem  an  acceptable  approximation),  it  is  largely  the  ability  of  mathematical 
models  to  achieve  these  final  refinements  that  justifies  their  use. 

The  solution  of  this  relatively  difficult  initial  prototypic  problem  on  an  IBM  3033 
using  FORTRAN  IV  H  (Extended)  with  OPTIMIZE  (2)  required  64  seconds  and  0.6 
megabytes  region. 

The  goals  and  hierarchical  linear  penalties  can  be  generated  in  many  ways,  providing 
a  rich  experimental  arena.  We  have  tested  static  goals  induced  from  initial  capacity 
estimates,  and  dynamic  goals  derived  from  subproblem  solutions  with  capacities  fixed 
by  preceding  master  problem  solutions.  Static  penalties  for  the  goals  have  been  com¬ 
pared  with  dynamic  asymmetric  penalties  determined  by  a  heuristic  which  examines 
shortages  and  excess  capacity  in  successive  master  problem  solutions.  The  combination 
of  static  goals  and  dynamic  penalties  has  performed  best. 

Flowever,  it  is  more  significant  that  in  our  experience  any  goal-penalty  combination 
exhibits  profound  improvement  over  classical  decomposition.  This  robustness  gives 
compelling  evidence  that  goal  decomposition  is  more  effective  in  dealing  with  the 
complications  of  infeasibility  than  classical  decompositions. 

Figure  3  shows  some  of  the  details  specific  to  the  NABISCO  problem  in  the  context  of 
the  algorithm  steps  in  Figure  2.  The  cost  and  activity  conversion  factors  serve  for 
translation  of  units  between  the  mixed  integer  master  problems  and  the  pure  integer 
network  subproblems;  the  other  penalties  and  tolerances  inherit  their  units  from  these 
conversion  factors. 


6.  Managerial  Experience 

The  particular  decision  support  system  developed  for  Nabisco  Brands  has  been 
named  BPDOS — Riscuit  Rroduction-Zfistribution  Optimization  System.  In  actual  de¬ 
cision-making  situations,  BPDOS  has  proven  itself  to  be  the  management  support 
system  it  was  conceived  and  built  to  be.  The  thorough  evaluation  of  the  facility  plan¬ 
ning  issues  that  Nabisco  Brands  must  address  on  a  frequent  basis  has  been  enhanced 
significantly.  The  top  managers,  who  ultimately  must  make  the  decisions  about  closing 
old  bakeries,  introducing  new  technology  or  products,  and  moving  production  facilities 
from  one  bakery  to  another,  now  can  do  so  with  fully  integrated  and  rapidly  available 
information  on  the  cost  and  capacity  implications  of  those  decisions. 

For  example,  BPDOS  has  been  used  to  help  management  analyze  the  operational 
and  financial  implications  of  closing  an  old  bakery  which  was  inefficient  by  today’s 
standards  in  materials  and  product  flows.  Additionally,  higher  than  average  mainte¬ 
nance  costs  were  eroding  the  profit  margins  of  products  made  in  that  bakery.  The 
question  was, 

“Can  the  products  made  in  the  old  bakery  be  produced  elsewhere  within  existing  capacity,  and, 
if  so,  at  what  cost?” 

Through  a  series  of  BPDOS  runs,  the  production  planning  analysts  were  able  to  demon¬ 
strate  to  top  management  how  the  production  capacity  lost  in  closing  one  location 
could  be  “made-up”  among  those  bakery  locations  that  would  remain.  The  realigned 
production-distribution  system  showed  which  products  should  be  produced  on  each 
combination  of  facilities  for  each  of  the  roughly  160  branches  (“customers”)  to  mini¬ 
mize  total  production  and  distribution  costs. 

Another  application  of  BPDOS  has  been  in  the  analysis  of  equipment  requirements 
to  convert  all  Ritz  cracker  production  to  “slug”  packs  vs.  “dump”  packs.  In  the  tradi¬ 
tional  dump  pack,  the  crackers  are  loose  inside  the  box.  With  the  slug  pack,  the  crackers 
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are  stacked  in  three  or  more  columns,  and  each  column  is  wrapped  separately  in  waxed 
paper.  Through  a  series  of  BPDOS  runs  the  production  planning  analysts  were  able  to 
tell  top  management  what  additional  equipment  would  be  required,  and  in  which 
locations,  to  convert  fully  to  slug-type  packaging. 

Yet  a  third  application  of  BPDOS  has  been  in  the  roll-out  planning  for  new  prod¬ 
ucts.  Given  market  forecasts  over  five-  to  ten-year  planning  horizons,  BPDOS  can  show 
how  new  production  capacity  should  be  introduced  over  time  to  minimize  current 
production  and  distribution  costs.  When  markets  develop  for  new  products,  as  reflected 
in  increased  demand,  production  facilities  must  be  rebalanced  to  get  the  optimal  facility 
utilization  mix  at  any  point  in  time. 

The  above  kinds  of  facility  planning  issues  have  always  existed,  and  they  were  not 
ignored  prior  to  the  implementation  of  BPDOS.  However,  prior  to  BPDOS,  each 
planning  scenario  took  about  three  days  to  work  up  manually;  given  the  size  of  most 
scenarios,  fully  integrated  and  consistent  evaluations  could  not  be  guaranteed,  much 
less  (nearly)  optimal  solutions.  Using  BPDOS,  the  scenario  evaluation  time  has  de¬ 
creased  to  half  a  day,  most  of  which  is  devoted  to  loading  new  data,  such  as  demand 
forecasts.  This  83%  reduction  in  time  to  carry  out  a  scenario  analysis  is  significant  since 
upwards  of  50  such  “What  If’  exercises  may  be  done  each  year.  That  translates  to 
saving  more  than  half  a  man-year  of  a  highly  experienced  production  planner;  and  that, 
in  turn,  translates  to  a  cost  reduction  of  over  $25,000. 

In  terms  of  actual  computational  time,  the  average  BPDOS  problem  has  taken  2-3 
CPU  seconds  on  an  IBM  3033  with  VM/CMS.  The  most  difficult  problems  seldom 
require  longer  than  60  CPU  seconds.  For  large  problems,  the  actual  turnaround  time 
for  the  analyst  sitting  at  a  display  terminal  has  been  about  three  minutes  per  scenario. 
Including  all  the  data  file  definitions  and  full-screen  prompting  menus  that  have  been 
set  up  for  BPDOS  in  the  CMS  operational  environment  at  Nabisco  Brands,  the  region 
requirements  are  just  under  0.75  megabyte. 

7.  Conclusions 

Although  our  model  has  a  strategic  perspective — fixed  charges  and  binary  decision 
variables  contribute  much  of  the  computational  optimization  workload — it  is  surpris¬ 
ing  that  significant  savings  are  achieved  just  by  resolution  of  production  and  transporta¬ 
tion  costs.  Even  for  a  fixed  configuration  of  plants  and  facilities,  the  optimal  assignment 
of  production  is  a  subtle  affair,  capable  of  producing  remarkable  cost  reductions. 

The  new  solution  methodology  has  revolutionized  our  thinking  about  decomposi¬ 
tion  and  relaxation  methods.  Many  years  of  computational  experience  have  convinced 
us  that  this  new  class  of  decompositions  has  much  to  recommend  it.‘ 

'  We  are  indebted  to  Messrs.  Les  Brewster  and  A1  Frank  of  Nabisco  Brands,  Inc.,  for  their  support  of  this 
research  project. 

Dr.  Richard  Powers  of  Insight,  Inc.,  has  been  of  invaluable  assistance  in  bridging  the  significant  gap 
separating  sound  theory  and  routine  application. 

A1  Washburn  has  suggested  several  clarifications,  and  Rick  Rosenthal  has  generously  contributed  improve¬ 
ments  in  style  and  presentation. 

This  paper  was  originally  presented  at  ORSA/TIMS,  Houston,  October  14,  1981,  under  the  title  “Large- 
Scale  Facility  and  Equipment  Location:  An  Application  of  Goal  Programming  in  Multicommodity  Decom¬ 
position”. 
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